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^\ Abstract 

o ■ 

Changes in the extent of local concavity and surface roughness of bind- 
•r-t . 

. ing sites of proteins have long been considered as useful markers to study 

' functional sites of proteins. However, an objective algorithm that de- 

scribes the connection between the simultaneous changes of these impor- 
tant parameters, eludes the students of structural biology. Here we pro- 
pose a simple yet rigorous algorithm that attempts to achieve the same. A 
generalized index is constructed here that not only describes the process 
but also reduces the estimation error in local shape characterization. The 
proposed methodology is easier to implement with computational tool-set. 
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Introduction : 

The concavity of the active sites of enzymes and the changes in their local geom- 
etry has long been recognized as key features to study their structure-function 
relationship (Lewis 1991; Pettit et al. 2007). Alongside this recognition, came 
the acceptance that change in surface roughness of patch of protein surface can 
serve as an equally useful marker, at least when attempting to find functional 
sites of that protein (Lewis and Rees 1985; Pettit and Bowie 1999). While 
there are works that successfully quantify important aspects of geometry of lo- 
cal shape of protein exterior (Laskowski et al. 1996; Peters et al. 1996; Liang 
et al. 1998; Brady and Stouten 2000); an algorithm that relates the local shape 
change to change of roughness of surface patch that holds this shape, is ex- 
tremely difficult to find. The reason for this conspicuous absence can probably 
be attributed to the complex nature of functional dependencies that infiuence 
surface roughness of any given patch of protein surface. It is owing to these 
(seemingly innumerable) dependencies that the surface roughness appears to 
be an n— dimensional random variable. In this paper, we propose a simple yet 
rigorous algorithm that transforms the surface roughness description from that 
of a n— dimensional random variable to a system of simple algebraic equations. 
Further, we define a generalized index that captures the simultaneous changes in 
local geometry and the roughness of the patch of protein surface that engulfs it. 
It can immediately be recognized that the algebraic description of this compli- 
cated dependency problem is much easily implementable in computational form 
than the same in the realm of n— dimensional continuous random variables. 

Methodology : 

Since the entire phenomenon of enzyme- inhibitor interaction (like most of the bi- 
ological interactions) is time-dependent and context-dependent, the local shape 
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of active site of enzyme is expected to undergo certain small yet significant 
change in its topography. Further, since any shape is necessarily coated with a 
surface, a minute change in the shape can account for a subtle change in the 
magnitude of surface roughness (at least potentially) too. Hence to describe 
the situation objectively, let [s = 0, 1, 2, . . . , r] be an estimate of the local shape 
of the active site of an enzyme when it is undergoing interaction with some 
inhibitor protein; and let [F — (/i, /2, . . . , /„)] be the corresponding vector of 
the state representing surface roughness (expressed with fractal dimensions) of 
the patch of the surface containing the aforementioned local shape. We define 
a function (piF), on which the functional Z = \\(j){F) — s\\ attains a minimum. 
Since the magnitude of the functional Z = (F) — s\\ has the capability to de- 
scribe a change in either </> (F) or s; the minimum magnitude of the functional 
will imply a state where the extent of change in (f) (F) follows the trend in extent 
of change in s, in the closest terms. Hence it is this minimum magnitude of the 
functional Z that we define as generalized index for studying the change in local 
shape of the enzyme active site, because only with this magnitude of functional 
Z, the change in both shape and surface roughness can be simultaneously de- 
scribed in the best objective manner. 

It is clear that the solution of the give problem is determined by the choice 
of the norm ||.||. To describe the situation from an unbiased perspective, we 
have considered F as an n— dimensional continuous random variable with corre- 
lation matrix C. We could then define (j) {F) as a predictor of a random variable 
s having a minimum mean square value. Thus the functional Z was re-written 
as : 
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Z = J2psE {^{F)-sf\s 



(1) 



s=0 



where E represented the expectation operator and Ps denoted the probability 
of occurrence of any arbitrarily chosen state of the shape, s. In the case where 
E {(j) {P) |s) = s for all [s = 0, 1, . . . , r], the given functional state attains a min- 
imum. 

To describe the situation from a bottom-up perspective, i.e., describing with 
respect to an individual fractal dimension (fk) that represented a particular 
value of surface roughness; we had : E (/fe|s) = s for all s and [fc = 0, 1, . . . , r]. 
Wc assume here that the form of the relations E{fi\s) and the coefficients a^, 
(i = 1, 2, n) remain unchanged in time. 

It was then possible to represent (j) {F) as a linear combination of indices : 



i=l 

where cti = ^\ since in this case E {(j) (F) \s) = s. The coefScients ai, 0:2, . . . , a, 
were evaluated from the minimum condition of the Lagrangian : 



Differentiating Z{a,X) with respect to ai {i = 1,2, ...,n) and equating the 
derivatives to zero we obtain a system of algebraic equations in a and A : 



n 




(2) 




(3) 
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J2aiJ2PsE[ifk-s)ifi-s)\s] + X = (4) 



i=l 



where {k = 1,2,..., n) and '^cti = 1. 

In case where C is the diagonal matrix, the solution of the given system was 
written as : 



and 



further, for the variance of the generahzed index we get 

E{cl>{F)-sf=(j27^] (7) 



\k—l 



Since Ckk, {k = 1, 2, n), are positive and bounded, we arrived at : 



limE{(j){F) - sf =0. (8) 



n— >oo 



Eq"-8 shows unambiguously how the introduction of the generalized index could 
reduce the estimation error in local shape characterization and how such an in- 
dex made it possible to transform the problem from the complex analysis of an 
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n— dimensional variable to that of involving scaler parameters. This particular 
aspect of the present work makes it easily compatible with the computational 
tool-kit. It is achieved without compromising with the necessary rigor, essen- 
tial to describe the intricate (time-dependent and context-dependent) coupling 
between simultaneous changes of two variables with large dependencies. 

In a special case, the regressions E{fk\s) = ipk{s) , (k = 1, 2, n) can 
be non-linear. This can be reduced to the one considered by transforming re- 
gressive relations to a linear form; for example, with the help of the polynomials 
M/s ifk) , {k = 1,2, ...,n); so that E [fXk (fk) \s] = s. Then the generalized index 
will be linear combination of polynomials : 

Conclusion : 

However, simple and useful as this technique is, it is not immune to limitations. 
For example, the algorithm proposed in the current work might not be efficient 
when applied to cases where functional sites are not appreciably concave. Pro- 
gressing on similar line thinking, since the protein-protein interaction interface 
is known to be planer (Jones and Thornton 1997), the scope of the application 
of an algorithm like the one proposed, might be restricted. Having said that, 
since for many proteins, the functional sites are known to be containing con- 
cave shapes (in the form of clefts and pockets) (Laskowski et al. 1996), the 
spectrum of contexts where the present algorithm can be applied assumes an 
imprcssivo rango. The form of simple algebraic equations, with which the final 
description of this complex biophysical interaction is expressed, makes it easily 
implementable with elementary computational apparatus. 
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